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Abstract. We derive a necessary and sufficient condition for a sequence of quantum 
measurements to achieve the optimal performance in quantum hypothesis testing. 
We discuss what quantum measurement we should perform in order to attain the 
optimal exponent of the second error probability under the condition that the first 
error probability goes to 0. As an asymptotically optimal measurement, we propose 
a projection measurement characterized by the irreducible representation theory of 
the special linear group SL(7i). Specially, in spin 1/2 system, it is realized by the 
simultaneous measurement of the total momentum and a momentum of a specified 
direction. As a byproduct, we obtain another proof of quantum Stein's lemma. In 
addition, an asymptotically optimal mcassurcmcnt is constructed in the quantum 
Gaussian case, and it is physically meaningful. 
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1. Introduction 

Deciding the true quantum state based on the two hypotheses is called quantum 
hypothesis testing, which is one of the most fundamental problems among quantum 
information theory because of the following reasons. The difficulty derived from non- 
commutativity of matrices (operators) appears as a simple form in this problem. This 
problem can be applied to other related topics in quantum information, for example, 
quantum channel coding ^ §], distillable entanglement ||, quantum estimation 
||, quantum universal variable-length source coding [pj, and quantum coin tossing |§. 

When the null hypothesis is the tensor product of a certain quantum state p and 
the alternative hypothesis is that of another quantum state a, we sometimes focus on 
the asymptotic behaviors of the first error probability (we reject the null hypothesis 
though it is correct) and the second error probability (we accept the null hypothesis 
though it is incorrect). Hiai and Petz 0, and Ogawa and Nagaoka [|K| discussed the 
optimal second error exponent under the assumption that the first error probability 
is less than a certain constant e > 0. Combining their results, we obtain that the 
optimal second error exponent is independent of e > 0, and coincides with the quantum 
relative entropy. Hiai and Petz |J proved the direct part, i.e., the attainability of the 
quantum relative entropy, and Ogawa and Nagaoka |L(J proved the converse part i.e., 
the impossibility for surpassing the quantum relative entropy. The converse part was 
simplified by Nagaoka [Q]. In addition, the quantum relative entropy coincides with 
the optimal second error exponent under the condition that the first error probability 
asymptotically goes to 0. Moreover, Ogawa and Hayashi |TT| discussed the second error 



exponent under the constant constraint for the first error exponent. 

We divide this testing process into two parts: One is the quantum part i.e., the 
quantum measurement process. When p and a are non-commutative, the choice of 
this quantum measurement is difficult and essential. The other is the classical part 
i.e., the classical data processing. In this paper, we focus on the former process i.e., 
we study what kind of measurement is suitable in order to achieve the optimal second 
error exponent. As mentioned in section |], it is sufficient for this kind hypothesis 
testing, to discuss our quantum measurement of a certain class. We derive a necessary 
and sufficient condition for a quantum measurement to attain the optimal second error 
exponent among this class. This condition depends on the alternative hypothesis a, and 
is almost independent of the null one p. As a byproduct, we obtain another proof of 
quantum Stein's lemma. 

In our setting, the unknown state is a tensor product state, but our measurement is 
not necessarily tensor product. Therefore, in order to treat the classical part we need to 
discuss our data-processing after our measurement as a classical hypothesis testing with 
two general sources. In classical information theory, by using the information-spectrum 
method, Han [|l^, [l3j studied hypothesis testing based on such a general setting. We 
apply it to our proof of the main result, and such an application to quantum hypothesis 



testing was initiated by Nagaoka fllj, [15]]. However, this paper is organized so that 



Optimal sequence of quantum measurements 



3 



the reader can understand the statement of the main result without any knowledge of 
the information-spectrum method. A quantum version of this method was discussed 
by Nagaoka and Hayashi |T(|, but it is not treated in this paper because it is not 
directly related to this issue. This work was motivated by Nagaoka's |L4], [L5| earlier 
works. Unfortunately, these Nagaoka's papers were written in Japanese, but Nagaoka 
and Hayashi |TB| contains a part of results by Nagaoka (JTJ, |T^] . 

This paper is organized as follows. In section we formulate quantum hypothesis 
testing with tensor product states as an asymptotic problem. In order to discuss 
our quantum asymptotic setting, we prepare some non-asymptotic characterizations 
in section [3|. After these preparations, we state the main results i.e., we characterize a 
quantum measurement to attain the optimal second error exponent in section ||. We 
treat quantum Gaussian states as a special example of the infinite-dimensional case 
in section [|, while we assume that the dimension of the Hilbert space of interest is 
finite in section |. In the quantum Gaussian case, we give an asymptotically optimal 
measurement whose physical interpretation is clear. In order to prove our main theorem, 
we have to discuss a general sequence of classical information sources. Thus, in section 
|6], by using the information-spectrum method, we prepare a lemma which is applicable 
to such a general sequence, and then apply it to our issue, we prove the main theorem 
in section [7| with a help of this lemma. We use some fundamental inequalities in section 
|7|, and these inequalities are given in section [8]. Moreover we assume a fundamental fact 
in section 01 this fact is proven from a representation viewpoint in section pi 



2. Asymptotic formulation of quantum hypothesis testing 

Let Ti be the Hilbert space of interest, and S(7i) be the set of density matrices on 
7i. When we perform a measurement corresponding to POVM (Positive Operator 
Valued Measure) M = {Mj} to a system in the state p, the data obeys the probability 
p m = {pM(^ = TrMip}. In particular, the POVM M = {Mi} is called a PVM 
(Projection Valued Measure) if each Mj is a projection. In the hypothesis testing, the 
testing is described by a 2-valued POVM {M a , M r }, where M a corresponds to acceptance 
and M r corresponds to rejection. Similarly, an operator A satisfying < A < I is called 
a test, and is identified with the POVM {M a , M r } = {A, I - A}. 

Now, we study the quantum hypothesis testing problem for the null hypothesis 
H : p® n e S(H® n ) versus the alternative hypothesis H x : a® n G S(H® n ), where p m 
and a® n are the nth-tensor powers of arbitrarily given density operators p and o in 
S{7i). In the sequel, an operator A n on 7i® n satisfying < A n < I or a sequence {A n } 
of such operators, is called a test. For a test A n the probabilities of the first and the 
second are, respectively, defined by 

a n {A n ) = Trp® n (J - A n ) and (3 n {A n ) = Tra 0n A n . 

We can understand that a n (A n ) is the probability of erroneously rejecting p® n though 
p®n j g £ rue an( j p^A^ j s error probability of erroneously accepting p® n although 
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B(p\\a):= supjA 
B\p\\a) : = sup | A 



p®n j g n0 ^. ^ rue _ \/ye discuss the trade-off of the two type error probabilities for two 
n-tensor product states 

The following is known as quantum Stein's lemma. 

Theorem 1 For any 1 > e > ; the equation 

- lim -log/3*(e) = D(p\\a) : = Tr p(logp - log a) 

n— >oo Tl 

holds, where f3*(e) is the minimum second error probability under the constraint that the 
first error probability is less than e i.e., 

FJe) : = min{/3 n (A")|0 < A n < I,a n (A n ) < e}. 

The part > was proven by Hiai and Petz ||. Its infinite-dimensional case was proved 
by Petz [I7||. The part < was proven by Ogawa and Nagaoka |[LG|| . 

In order to discuss the above theorem, we define the two following quantities 

3A, lim a n {A n ) = 0,limsup-log/? n (A n ) < -A 

n~*oo n~-nx> n 

3A, limsupa„(A n ) < 1, lim sup - log /3 n (A n ) < -A 

i.e., the former is the optimal second error exponent under the constraint that the first 
error probability goes to while the later is the optimal second error exponent under 
the other constraint that the first error probability does not go to 1. Thus, quantum 
Stein's lemma (Theorem [I]) is equivalent to the following relations 

B{p\\a) = B\p\\a) = D{p\\a). 

In the following, we divide this testing process into the two parts i.e., the quantum 
part and the classical part. The quantum part corresponds to the choice of a sequence 
M := {M n } of POVMs, and the classical part does to the decision process from the 
classical data. Now, we fix a sequence M := {M n } of POVMs, and denote the data set 
of M n by fl n . Then, this classical part is formulated as the classical hypothesis testing 
with the null hypothesis P^ and the alternative hypothesis P£®n- In this setting, a 
test is described by a function T n from Q n to [0, 1] instead of an operator A n satisfying 
< A n < I. In particular, when a test T n is a test function with the support S n , this 
test is equivalent to the test with the accept region S n . Similar to B(p\\a) and B^(p\\a), 
we define 

1 



B M (p\\a) := sup <J A 



3A, lim E^l (1 - T n ) = 0, lim sup - log (T n ) < -A 



B^(p\\a) := sup (a 3A, limsupEj£(l - T n ) < 1, lim sup -logE^I (T") < -A } , 

where E^ J denotes the expectation regarding the probability P^. We can easily check 
that 

B(p\\a) = sup B%\\a) < B*(p\\a) = sup B^(p\\a). 

M:POVMs M:POVMs 
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Therefore, Theorem |1] is equivalent to the following relations 

snp B a (p\\*)= snp B^{p\\a) = D(p\\a). (1) 

M:POVMs M:POVMs 

In this paper, we focus on a sequence M := {M n } c ^ =1 of POVMs that satisfies the 
condition 

B a (p\\a) = D(p\\a), (2) 

and call such a sequence M of POVMs(PVMs) an optimal sequence of POVMs(PVMs) 
in the sense of Stein's lemma. The main issue is a characterization of a sequence 
M := {M n }^L 1 of POVMs that satisfies the condition @). As mentioned in section f§, 
our characterization of such an optimal sequence is independent of the null hypothesis p, 
and depends only on the alternative hypothesis a. Of course, in section |4], we construct 
such an optimal sequence. Indeed, if a sequence M of POVMs satisfies the condition 
(§), there exists a sequence A := {A n } of tests satisfying 

lim a n (A n ) = 0, - lim - log(3 n (A n ) = D(p\\a) - e, (3) 

for any e > 0. In the following, we assume that the dimension of Ji is finite (k) and the 
inverse a -1 of a exists. 



3. Non-asymptotic characterization of PVMs 

In order to treat the condition (0), we need some characterizations concerning PVMs in 
the non-asymptotic setting. One may think that these characterizations have no relation 
with the condition (0), but they are essential for our issue. 

A state p is called commutative with a PVM E(= {Ei}) on 7i if pEi = Eip for 
any index i. The spectral decomposition of any operator X can be regarded as a PVM 
and it is denoted by E(X). In particular, we have E(a) = E(\oga). The map Ee with 
respect to a PVM E is defined as: 

S E : p^ y^^EjpEj, 

i 

which is a linear map from the set of Hermite operators to itself. Note that the state 
Se{p) is commutative with a PVM E. The number sup^ranki?, of a PVM E = {Ei} is 
an important quantity in the following, and is denoted by w(E). Next, we focus on two 
PVMs E{= {Ei} ieI ), E{= {Ejjj^j). We write E < F if there exists a subset (F/E)i of 
the index set J such that Ei = J2je(F/E) z Fj f° r an y m dex i 6 I- If a PVM F = {Fj} 
is commutative with a PVM E = {Ei}, then we can define the PVM F x E = {FjEi}, 
which satisfies F x E > E and F x E > F, and can be regarded as the simultaneous 
measurement of E and F. 

Lemma 2 If p and a are commutative with a PVM E, then the equation 
mi{(5{A) \a{A) < e} 
= inf {(3(A) |3M : PVM, M > E. M > E(A), a(A) < e, w(M) = 1 } 

holds. 
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Proof: For any A, the relations /3(£ E (A)) = [3(A) and a(£ E (A)) = a (A) hold. 
Since the PVM E(£ E (A)) commutes the PVM E, there exists a PVM M such that 
M> E,M> E(S E (A)) andw(M) = 1. ■ 
Indeed, if a test A and a PVM M satisfy M > £/(A), the test A is performed by 
combining the quantum measurement M and a suitable data processing. Therefore, 
when p and a are commutative with a PVM E, we may discuss only PVMs M satisfying 
M > E i.e., we can restrict our tests. 



4. Main Result 



In this section, we discuss the condition @ under the assumption that there exist 
PVMs E n such that each PVM E n is commutative with states a® n and p® n and 
w(E n ) < (n + This existence is proven by the representation theory in section ^. 

Since, it follows from Lemma that we may treat only a PVM satisfying M n > E n and 
w(M n ) = 1, we obtain 

B(p\\a)= sup B*{p\\<r)= sup B*{p\\<r) 

M:POVMs M={M"}:PVMs S.t. M n >E n ,w(M n )=l 

B\p\\a)= sup B^{p\\a)= sup & a (p\\a), (4) 

M:POVMs M={M"}:PVMs S.t. M">B",w)(M")=l 

i.e., we can discuss B M (p\\a) and i?"''' M (p||cr) only of a sequence M satisfying the 
condition M n > E n , w(M n ) = 1. 

pM" 

Therefore, our main issue is the asymptotic behavior of the variable - log P M " for 
the probability distribution P^l under the condition M n > E n , w(M n ) = 1. 

Theorem 3 For any sequence M = {M n } of PVMs satisfying M n > E n , w(M n ) = 1, 
the relation 

B%\\a) = D(p\\a) (5) 

holds, if and only if the variable — -logP^™ converges to — Trplogcr in the probability 
distribution P^" . 

For example, if a PVM M n is commutative with cx® n and satisfies M n > E n , w(M n ) = 1, 
the equations 

= Tr p 0n f - log a® n - Tr p log = Tr p® n Q (log a) (n) - Tr p log <x^ 

1 2 
= — Tr p (log a — Tr p log a) (6) 

n 

hold. The equation (^) implies that the variable ^ log P*i™ converges to Tr p log a in 
probability. Therefore, it is optimal in the sense of Stein's lemma. This PVM coincides 
with the PVM proposed by Hayashifi~8||. In particular, as guaranteed in section ||, in 
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spin 1/2 system, E n x E(o~® n ) can be regarded as a simultaneous measurement of the 
total momentum and a momentum of the specified direction. As a byproduct, we can 
prove the following theorem. 

Theorem 4 Any sequence M of POVMs satisfies 

D(p\\a) > B^(p\\a). 
Theorems ^| and |] yield relation ([]]), which is equivalent to quantum Stein's lemma. 



5. Quantum Gaussian states 

In this section, we discuss a quantum hypothesis testing whose hypotheses are quantum 
Gaussian states pg on an infinite dimensional space L 2 (R): 



1 C \a-e\ 2 „ 

^= / |a)(a|e v~ d 2 a, V# G C, 
ttN Jc 



where we define the boson coherent vector |a) := e~^~ J^fclo ^fl^) anc ^ 1^) ^ s ^ ne 
number vector on L 2 (M). Now, we propose a suitable PVM. This PVM not only attains 
the optimal exponents of the second error probability, but also has an intuitive physical 
meaning. When the null hypothesis is the state pf™ and the alternative hypothesis is the 
state p®", our PVM is constructed as follows. First, we perform the unitary evolution 
as 



Second, we perform the unitary evolutions 

(5?n ^ <g>(n-l) 

The constructions of these unitary evolutions were mentioned in Appendix F of Hayashi 
0. Finally, we perform a number detection {\k)(k\}^L Q to the system whose state is 
P,/n(0-9i) an d denote the final data (this PVM) by k (Mq ). Following to section 7.1 in 
Hayashi |J and its appendixes, we obtain 



for any e > 0, and any 9 E C. Therefore, when we choose the acceptance region 




as 



\6 — 9i\ 



> el, the optimal exponent of the second error probability can 
be approximately attained. Note that this measurement depends on the alternative 
hypothesis pg 1 , and is almost independent of the null hypothesis pg . This optimality is 
guaranteed because the converse part by Ogawa and Nagaoka []T0] is valid in this case. 
Thus, it implies 

B Mei (pe \\pe 1 ) = D(p eQ \\p ei ), 
where M 01 := {M£}. 
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6. Application of information-spectrum method 



In order to prove Theorems |3] and |], we have to treat general sequences of probabilities 
having no structure like a Markov chain because the sequence of two probabilities P^| n 



and P*i„ generally have no structure. In the classical information theory, Han[[T^, ^ 
introduced the information-spectrum method in order to treat a general sequence of 
information sources. In this section, we simply review the information-spectrum method 
in classical hypothesis testing, and by using this method, we characterize B M (p\\a) and 
B^' M (p\\a). Given two general sequences of probabilities p = {p n } and q = {q n } on the 
same probability sets {fi n }, we may define the general hypothesis testing problem with 
P — {Pn} as the null hypothesis and q = {q n } as the alternative hypothesis. In this 
situation, Any classical test is described by a function T n : Q n — > [0, 1]. For any test 
T n , the error probabilities of the first and the second are, respectively, defined by 

a n (T n ):= (1 — T n ((j n ))p n (u n ), (3 n (T n ):= ^ T n (u n )q n (uj n ). 



We focus on the two following quantities 



3A 



B(p\\q) := sup |A 
B (p\\q) '■= sup < A 



lim a n (T n ) = 

n— >oo 

3 A, liminfa n (T r 



0, lim sup - log p n (T n ) < -A 



< 1, lim sup - log (5 n (T n ) < -A 

n — >no Tl 



which can be regarded as generalizations of B M (p\\a) and B^ ,M (p\\a). In the independent 
identical distributed (i.i.d.) case of p and q, as is known as Stein's lemma, these 
two values B(p\\q) and B\p\\q) coincide with the relative entropy (Kullback-Leibler 
divergence) D{p\\q). Since the relative entropy D(p\\q) is the expectation of the variable 
log - under the distribution p, these two values B(p\\q) and B\p\\q) seem related to 
the variable ^ log — under the distribution p n . In order to characterize the asymptotic 
behavior of the variable - log — under the distribution p„ , we define other two values 
by 



D(p\\q) := sup|A 
D(p\\q) :=inf (a 



lim p n 



lim p n 



UJr. 



OJr. 



1 , Pni^n) 

n q n {u n ) 

1 , Pn(uJ n ) 

n q n {un) 



< A 



> A 







As guaranteed by the following lemma, two values B(p\\q) and B(p\\q) are characterized 
by the asymptotic behavior of the variable ~ log 221 under the distribution p n . 

Lemma 5 (Han ||12|| ,Verdu [ pTQf] , Nagaoka ||14| , |15[1 ) We can show the relations 

B(p\\q) =D(p\\q) (7) 
B\p\\q)=D{p\\q)- (8) 

Defining a test T n (A) as the test with the acceptance region S n {X): 



S n (X) 



UJr, 



1 , Priori) 

-log—, — { 



> A 
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we obtain 

a n (T n (D(p\\q)-t)) ^0 (9) 

Pn(T n (D(p\\q) - e)) < e ~ n{mm - £) (10) 

liminf a n (T n (D(p1|g*) - e)) < 1 (11) 

n^oo 

/3„(T n (D(pl|g) -e)) < e -n(DW\q)-e) ^ 

for any e > 0. Thus, the tests T n (D(p\\q) — e) and T n (D{p\\q) — e) approximately attain 
the bounds D_(p\\q) and D(p\\q), respectively. 

The equation (0) was proven in Chapter 4 in Han[12|]. He referred to Verdu|l9||. The 



equation ([8]) was derived by Nagaoka |14|, [15]]. Nagaoka and Hayashi |T^] contains 



Lemma [5] and a comprehensive review of the information-spectrum method. For reader's 
convenience, we give a proof in Appendix A. 

Next, in order to apply Lemma || to the characterization of B M (p\\a) and B^' M (p\\a), 
we define 

Df\p\\a) :=D({P%:}\\ {P^}) , D"\p\\a) :=Z>({Pj£}|| 
for any sequence M of POVMs. From Lemma |5], we have 



L^fpM = B**(p\\a), D M (p\\a) = B^t 



o- 



Therefore, a sequence M of POVMs is optimal in the sense of Stein's lemma if and only 
if 

D*(p\\c) = D(p\\a). 

In the following, we discuss Df 1 (p\\a) and D M (p\\a) instead of B a (p\\a) and B^ a {p\\a). 

In this paper, we use only Lemma among several results regarding the information- 
spectrum method, and this lemma is sufficient for our current issue. This paper 
treats only an application of the classical information-spectrum method to quantum 
hypothesis testing, while Nagaoka and Hayashi [|I^] discussed a quantum analogue of the 
information-spectrum method. The references, Han [C^, |T^] and Nagaoka and Hayashi 



16j1 may be useful for the reader who is interested in other related topics concerning the 



information-spectrum method. 
7. Proof of main result 

In this section, by using Lemma |^, we prove Theorems ^ and [|. In our proofs, we use 

Lemma ^ and several fundamental inequalities given in section || 

Proof of Theorem [|: As is guaranteed by Lemma ||], it is sufficient to show 

Df\p\\a) = D(p\\a). 

First, we prove that the variable ^logP^™ converges to Trplogp in probability. We 
can calculate 

2 



^P^Wl-logP^W-Trplog, 



.9 

n 
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Tr£ A/ n(p 5 



1 log £ M n (p®») - Tr p log p] = Tr p®« f - log £ M « (p® n ) - - Tr p log p 

/ V n n 



n 



<2Trp® n -\ogS M n(p® n ) - -logp® n +2Trp® n - log p® n - Tr p log p 



n 



n 



n 



< 



8 Afe-l)log(^ + l) y + 2Tr ^n (I(logp)W -Trplog P y 



(A; - 1) log(n + 1) 



+ - Tr p (log p - Tr p log p) , 
n J n 

where the last inequality follows from Lemma [7| given in section |8|. Thus, the variable 
^ log P^|™ converges to Tr p log p in probability. 

pM" 

Since MogpC = ilogP^ - JlogP^, and L>(pH = Tr plogp - Tr ploga, the 
condition (13) is equivalent to 



Tr p log a = sup < A 



1 



inn p^;<j--io g pf 8 ; < xy oy 



(13) 



Since 
sup < A 



lim PT n 

n— >oo H 



logPi; < AJ> =0 J. <inf<!A 



lim P^:<j--logP^; >A^=0[>, 



n 



it follows from Lemma ^ that the condition (|13j) equivalent to 



Tr p log (T = sup < A 



inN A 



Inn Pj|: <j --logP^; <A^ = 



>Af n 



Inn P^|; <J--logP^; > A^ ^. 



Therefore, if and only if the variables — - log P^i™ converges to — Tr p log a in probability, 
the relation (13) holds. ■ 
Next, we give Lemma |6| with a proof, which is used in our proof of Theorem |3|. 

Lemma 6 Under the same assumption as Theorem^, we obtain 



Tr p log o > inf < A 



Inn PjC<j--logP^; >A^ = 



(14) 



Proof: We discuss the asymptotic behavior of the variable - log P*®" . From Markov 
inequality, we have 

p{X >a}< e - A{x ' p ' a \ A(X,p,a) := sup (at - log / e tx{uj) p( du) 

o<t<\ V J 

We can calculate 

A(-logP^;,PX,an) = sup ant - logTr (£ m «(p®") (^"(O)' 

o<t<i \ 



= sup ant - logTr p 0n 

0<*<1 ^ 

> sup ant - ft log w(E n ) + logTrp 0n (a®* 1 )^) 
o<t<i \ ' 

log w(E n 



sup n at — t 



0<t<l 



n 



log Tr pa 
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where the inequality > follows from Lemma |1(] given in section [8[ If a > — Trp log a, 
then there exists a real number t such that < t < 1 and log Tr ^ cr — - < a because 
lim t ^o log Tr f pa — = — Tr p log a. Therefore the inequalities 

lim inf — log Pj£ ( - - log P JT„ > a 

rwoo n ^ [ n 

> lim inf A (- log P^PX, an) > lim sup (at - 1 (fe + 1} bg(n + 1} - log Tr pa A 
= sup (at- log Trp<r - *) > t (a - logTrpa — ) >o 

0<i<l ^0 

hold, i.e., 

limP^lj-ilogP^; >aj=0. 

Thus, inequality ( [H] ) holds. ■ 
Proof of Theorem [|: From equation (|J) and Lemma [5|, it is sufficient to prove the 
inequality 

D(p||a)>D Al (p||a) 

for any sequence M = {M n } such that M n > E n and w(M n ) = 1. As shown in 
our proof of Theorem ^|, the variable ^ log P^" converges to Tr p log p in probability. 
Therefore, 



D M (p\\a) = Tr p log p + inf | A 



I. 



limP";^-ilogP^, >A}.=0 



n— >oo r n 

< Trplogp — Trploga = D(p\\a), 
where the inequality follows from (|14"D. ■ 

8. Fundamental inequalities 

In this section, we give some fundamental inequalities used in our proofs of Theorem [5] 
and Lemma 

Lemma 7 // PVMs E, M satisfy M > E and a state p is commutative with E nd 
w(E) > 3, then the inequality 

TrpQogp - log S M (p)) 2 < 4(\ogw(E)) 2 (15) 

holds. 

Proof: Define ai := Tr EipEi, pi := ^-EipE^ then the equations p = ^2 li a i p i ^£ M {p) = 
^2, i a i £M{pi) hold. Using the operator inequality (A + B) 2 < 2(A 2 + B 2 ), we have 

TrpQogp - log£ M (p)) 2 = J^ajTrp^logpi - log£ M (pi)) 2 

i 

< supTrpi(logpi - log£ M (pi)) 2 < supTrpi2 ((logp;) 2 + (log£ Af (pi)) 2 ) 

i i 

= 2sup (Trp^logp,) 2 + Ti£ M (Pi)(\og£ M (Pi)) 2 ) < 4 sup (log dim ^) 2 , 
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where the last inequality follows from Lemma |8|. We obtain (|I5D. ■ 

Lemma 8 (NagaokapO[|, Osawa pT|| ) The equation 

k 

Pi > o, = i 



max \ ^PijhgPi) 2 

i=i 



(logfc) 2 ifk>3 

, ilog ^3V + ^f log ^E?V <f* = 2 (16) 

holds. 

Its proof is given in Appendix B. 

Lemma 9 Let k be the dimension ofTi. For any state p G S(7i) and any PVM M , the 
inequality p < k£M{p) holds. 

Proof: The relations 



2 

> 



(^|(^ / (|0)(0|)A;-|0)(0|)|^) = A;^^|M#)(0|M#)- J^M^ 

8=1 1=1 

hold for V0, V^> G 7^, where the inequality follows from Schwartz' inequality about 
vectors {{ip\Mi\(j))}^ =1 , {l}* = i- Thus, we obtain \4>)(4>\ < /c£m(|0)(0|)- Any state p can 
be decomposed as p = J2i s i\ < Pi)( < Pi\- Thus, 

P = y^Sj|<fti)(0jl < y^SjfegM(|0i)(0il) = k£ M (p). 

i i 

The proof is completed. ■ 

Lemma 10 Le£ p be a state commuting the PVM E. If the PVM M satisfies M > E, 
the operator inequality 

w{Efp* > (MP))"' (17) 
holds for < t < 1 u>/ien p _1 is bounded. 

Proof: Based on the same notation as in our proof of Lemma ^|, it follows from Lemma 
| that 

P = y~]aiP% < ^aiiankEiSM^pi) < ^a i w{E)£ M {p i ) = w{E)£ M (p). 

i i i 

Since the map u — > — u~ l (0 < t < 1) is an operator monotone function in (0, oo)|[22||, 
the operator inequality (|T^) holds. ■ 

9. Relation between p® 71 , cr® n and group representation 

In this section, by using the representation theory, we prove the existence of a PVM E n 
such that the PVM E n is commutative with states a® n and p® n and w(E n ) < (n+ l) fc_1 . 
In subsection EO, for this purpose, we consider the relation between irreducible 



representations and PVMs. In subsection [L2j we discuss n-tensor product states from 



a group theoretical viewpoint and prove the desired existence. 
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9.1. group representation and its irreducible decomposition 

Let V be a finite dimensional vector space over the complex numbers C. A map n from 
a group G to the generalized linear group of a vector space V is called a representation 
on V if the map 7r is homomorphism i.e. 7r(gi)7r(g 2 ) — ^{9192)1 ^9ii92 G G. A 
subspace W of V is called invariant with respect to a representation 7r if the vector 
7f(g)w belongs to the subspace W for any vector w G W and any element g £ G. A 
representation ir is called irreducible if there is no proper nonzero invariant subspace 
of V with respect to ir. Let tti and 7r 2 be representations of a group G on Vj. and 
V 2 , respectively. The tensored representation tti © 7r 2 of G on Vi © V 2 is defined as 
(iti © 7r 2 ) ((?) = TTi(g) ® ^(fiO, and the dzreci sm representation 7i"i © 7r 2 of G on Vi © V2 
is also defined as (tti © 7r 2 ) ((?) = 7T! (c?) © n 2 (g). 

In the following, we treat a representation 7r of a group G on a finite-dimensional 
Hilbert space 7i; The following facts is crucial in the later arguments. There exists an 
irreducible decomposition H — Hi © • • • ®Hi such that the irreducible components are 
orthogonal to one another if for any element g G G there exists an element g* G G 
such that 71(g)* = ir{g*) where 71(g)* denotes the adjoint of the linear map 7r(g). We 
can regard the irreducible decomposition Ti. = Hi © • • • © Hi as the PVM {-P?-^}; =1 , 
where denotes the projection to Hi. If two representations Ti\ and 7r 2 satisfy the 
preceding condition, then the tensored representation Ti\ © 7r 2 , also, satisfies it. Note 
that, in general, an irreducible decomposition of a representation satisfying the preceding 
condition is not unique. In other words, we cannot uniquely define the PVM from such 
a representation. 



9.2. Relation between the tensored representation and PVMs 

Let the dimension of the Hilbert space H be k. Concerning the natural representation 
^sl(H) °f th e special linear group SL(H) on H, we consider its n-th tensored 
representation 7if^ n ^ '■= ttsl(w) © • • ■ © 7r SL(w) on the tensored space H® n [^3], |23]]. For 



any element g G SL(H), the relation 7r SL ( W )(g)* = itsl(h)(.9*) holds where the element 
g* G SL(H) denotes the adjoint matrix of the matrix g. Consequently, there exists an 
irreducible decomposition of Trf^) regarded as a PVM and we denote the set of such 
PVMs by Ir® n . 

From the Weyl's dimension formula ((7.1.8) or (7.1.17) in Goodman-Wallach||24||), 
the n-th symmetric tensored space is the maximum-dimensional space in the irreducible 
subspaces with respect to the n-th tensored representation k$uh) ■ ^ s dimension equals 
the repeated combination k H n evaluated by ^H n = ( n ^. k L 1 1 ) — ( n+ t _1 ) = n+xHk-i < 
(n + l)*" 1 . Thus, any element E n G Ir® n satisfies w(E n ) < (n + l)* -1 . 

Lemma 11 A PVM E n G Ir® n is commutative with the n-th tensored state p® n of any 
state p on H. 

Proof: If det p ^ 0, then this lemma is trivial from the fact that det(p) _1 p G SL(H). 
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If det p = 0, there exists a sequence {pi]°^i such that det pi ^ and pi — > p as « — > oo. 
We have pf n — ► p® n as i — > oo. Because a PVM E n e Ir® n is commutative with pf n , 
it is, also, commutative with p® n . ■ 
Therefore, the existence of a desired PVM is proven. In particular, in the spin 1/2 
system, E n corresponds to the measurement of the total momentum. Therefore, 
E n x E(o~® n ) can be regarded as a simultaneous measurement of the total momentum 
and a momentum of the specified direction. 



10. Conclusion 



We discuss quantum measurements from a viewpoint of quantum hypothesis testing. 
We characterize a sequence of quantum measurements whose second error exponent 
attains the quantum relative entropy at Theorem in the finite-dimensional case. As 
a byproduct, we give another proof of quantum Stein's lemma. This characterization 
is closely related to the irreducible decomposition of the tensored representation of the 
group SL(7i). In our proof of the main theorem, the information-spectrum method plays 
an important role. In the further research of quantum information, this method seems a 
powerful and useful technique. In addition, as a special case of the infinite-dimensional 
case, we treat the quantum Gaussian states. The photon counting measurement is used 
in the construction of our asymptotically optimal measurement, and this fact indicates 
its importance. 
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Appendix A. Proof of Lemma [5] 

We simplify D(p\\q) and D(p\\q) by D and D, respectively. 
Direct part > of (ffi): For any e > 0, we have 



a n (T n (D - e)) = p n (S n {D_ - e) c ) = p n \u 



- log — - — f < D - e > ^0 

n q n {^n) 



and 



P n (T n (D-e))=q n ^u. 

< e- n ^p n < 



1 , PnM 

— log — ; r > V — e 



n q n {uJn) 
- log 

n q n {uJn) 



> D-e \ < e~ n ^- e K 



which imply (H) and (0). Thus, 



Umsup-log^ n (T n CD-e)) < -{D-e). 

n—*oo n> 
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lim inf p n < u n 



n q n [LO r> 



D = sup | A 

For any e > 0, similarly, we have 

lim inf a n (T n (D — e)) = lim inf p n <{ u. 

P n (T n (D-e)) <e 
which imply ( |TTD and (|P2|). Thus, 



n— >oo 
-n(D-e) 



1 , Pn{u n ) — 

- log — ) — J - < D - e I < 1 



\immi-logp n (T n (D-e)) < -(D - e). 

n— +oo Tl 

Converse part < of (^): Assume that a n (T n ) — > as n — > oo and 
limsup-log/3 n (T n ) = -A 
For any e > 0, the inequality 

a n (T n (R - e)) + e n ^/3 n {T n (R - e)) = 1 + ^(e"( R - e )g n K) - p n (ou n ))T n (R - e)K) 



< 



1 + ^(e^- e )g n M - p n K))^ n K) = a n (T") + e n <*- £ >/3 n (T") 



(A.l) 



holds. Since the RHS goes to and e n(fl " e) [3 n {T n (R - e)) > 0, the relation 



Pn \ u n 



1 . PnM 

log ; < R — € 



a n (T n (R-e))^0 



n q n (^n) 
holds. It implies that R — e < .D. 

Converse part < of (^): Assume that liminf^oo a n (T n ) < 1 and 

limsup-log/3 n (T n ) = -R. 

n— >oo Vh 



(A.2) 



For any e > 0, from ( |A.1[ ) and ( |A.2| ), we have 



lim inf p n { uj n 



1 - Pn{0J n ) 



log- 



n q n (u n ) 
It implies that R — e < D. 



< i?- 



liminfa n (T n (.R-e)) < liminf a„(T n ) < 1. 



Appendix B. Proof of Lemma ^ 

In the cases k = 2,3, the equation fliTf ) is checked by a calculation. Now, we prove (Pp 
by induction in the case > 4. Let be the RHS of (|16|). The inequality a fc > (logfc) 2 
is trivial. From the assumption of the induction, if = Yli=i Pii.^°&Pi) 2 ^ then > 
(i = 1, . . . , k). Using Lagrange multiplier method, we have (logpj) 2 + 2 \ogpi — A' = 0, 
where A' is the Lagrange multiplier. The solution is written by log pi = — 1 ± A, where 
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A := vl + A'. Without loss of generality, we can assume that there exists < r < k 
such that 



log Pi 



-1 + A if r > i 
-1 — A if r < i. 



Since YliiVi = 1, we have 

1 = re - 1+x + (k — r)e^- x , 
which is equivalent to the quadratic equation 



rx 



ex + k — r = 0, 



where x := e A . Since the discriminant is greater than 0, we have 

e 2 - 4r(k - r) > 0, 
which is solved as: 



k-Vk 2 - e 2 k + Vk 2 - e 2 
r < , < r. 



2 

x— y/x 2 — e 2 



(B.l) 



is monotone decreasing in (e, oo), and c(4) < 1. Thus, 



The function c(x) :- 
the condition (p.l| ) implies that r = or k. Thus, we have pi = 1/k i.e., ([IB]) . 
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